High-throughput functional testing of ENCODE segmentation predictions.

نویسندگان

  • Jamie C Kwasnieski
  • Christopher Fiore
  • Hemangi G Chaudhari
  • Barak A Cohen
چکیده

The histone modification state of genomic regions is hypothesized to reflect the regulatory activity of the underlying genomic DNA. Based on this hypothesis, the ENCODE Project Consortium measured the status of multiple histone modifications across the genome in several cell types and used these data to segment the genome into regions with different predicted regulatory activities. We measured the cis-regulatory activity of more than 2000 of these predictions in the K562 leukemia cell line. We tested genomic segments predicted to be Enhancers, Weak Enhancers, or Repressed elements in K562 cells, along with other sequences predicted to be Enhancers specific to the H1 human embryonic stem cell line (H1-hESC). Both Enhancer and Weak Enhancer sequences in K562 cells were more active than negative controls, although surprisingly, Weak Enhancer segmentations drove expression higher than did Enhancer segmentations. Lower levels of the covalent histone modifications H3K36me3 and H3K27ac, thought to mark active enhancers and transcribed gene bodies, associate with higher expression and partly explain the higher activity of Weak Enhancers over Enhancer predictions. While DNase I hypersensitivity (HS) is a good predictor of active sequences in our assay, transcription factor (TF) binding models need to be included in order to accurately identify highly expressed sequences. Overall, our results show that a significant fraction (-26%) of the ENCODE enhancer predictions have regulatory activity, suggesting that histone modification states can reflect the cis-regulatory activity of sequences in the genome, but that specific sequence preferences, such as TF-binding sites, are the causal determinants of cis-regulatory activity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving microRNA target prediction in humans using a highly descriptive graph-based machine-learning model

Computational prediction of animal microRNA target sites imposes a tough challenge on research, since complementarity of functional microRNA-target interactions is usually small, which inevitably leads to a high number of false positive predictions. Prediction programs try to cope with this dilemma by applying additional filtering, but still their current performances are far from optimal. In t...

متن کامل

Identification and annotation of non-protein coding RNAs

Of the 3.3 billion bases of the human genome, only about 2% code for proteins. Since very recently, the remaining 98% have been considered to be ’junk’ and functionless. However, large transcriptomic studies like ENCODE (ENCyclopedia Of DNA Elements) (1) or FANTOM (The Functional Annotation Of the Mammalian Genome) (2) have shown that around 90% of the genome is actively transcribed into RNA. T...

متن کامل

Eukaryotic membrane protein overproduction in Lactococcus lactis.

Eukaryotic membrane proteins play many vital roles in the cell and are important drug targets. Approximately 25% of all genes identified in the genome are known to encode membrane proteins, but the vast majority have no assigned function. Although the generation of structures of soluble proteins has entered the high-throughput stage, for eukaryotic membrane proteins only a dozen high-resolution...

متن کامل

Diagnosis of brain tumor using PNN neural networks

Cells grow and then need a very neat method to create new cells that work properly to maintain the health of the body. When the ability to control the growth of the cells is lost, they are unconsidered and often divided without order. Exemplified cells form a tissue mass called the tumor. In fact, brain tumors are abnormal and uncontrolled cell proliferations. Segmentation methods are used in b...

متن کامل

Broad-Enrich: functional interpretation of large sets of broad genomic regions

MOTIVATION Functional enrichment testing facilitates the interpretation of Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) data in terms of pathways and other biological contexts. Previous methods developed and used to test for key gene sets affected in ChIP-seq experiments treat peaks as points, and are based on the number of peaks associated with a gene or a bi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genome research

دوره 24 10  شماره 

صفحات  -

تاریخ انتشار 2014